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Abstract 

We analyze and study the effects of locality on the fault-tolerance threshold for quantum computation. 
We analytically estimate how the threshold will depend on a scale parameter r which characterizes the scale- 
up in the size of the circuit due to encoding. We carry out a detailed semi-numerical threshold analysis for 
concatenated coding using the 7-qubit CSS code in the local and the 'nonlocal' setting. First, we find that the 
threshold in the local model for the [[7, 1, 3]] code has a 1/r dependence, which is in correspondence with 
our analytical estimate. Second, the threshold, beyond the 1/r dependence, does not depend too strongly 
on the noise levels for transporting qubits. Beyond these results, we find that it is important to look at more 
than one level of concatenation in order to estimate the threshold and that it may be beneficial in certain 
places, like in the transportation of qubits, to do error correction only infrequently. 
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I. INTRODUCTION 

The issue of fault-tolerance is central to the future of quantum computation. Most studies of 
fault-tolerance until now III Q, U] have focused on deriving fault-tolerance in a setting where 
gates between any two qubits can be executed instantaneously, i.e. without taking into account the 
potential necessity to move qubits close together in space prior to gate execution. We call this 
setting the nonlocal model. Current estimates of the fault-tolerance threshold in the probabilis- 
tic independent nonlocal error model can be found in the extensive studies performed by Steane 
Jfl, estimating the threshold failure probability as 0(1CT 3 ). The recent results by Knill [6] and 
Reichardt [7] even give estimates that can be an order of magnitude better, i.e. O(10~ 2 ). 

nnn 

It has been argued, see [1, 5, 8] and the analysis in [9], that the local model, where qubit 
transportation is required, would still allow for a fault-tolerance threshold, albeit somewhat lower 
than in the nonlocal model. However, there has not been any assessment of how exactly locality 
influences the threshold, i.e. what is the dependence on the code, the spatial size of the error cor- 
rection procedure, the failure rates on the qubit wires, etc. Such an assessment is timely, because 
the post-selected schemes by Knill J^] in which large entangled states are prepared in a trial-and- 
error fashion (and to a smaller certain extent also the ancilla preparation procedure proposed by 
Reichardt O]) may fare worse compared to the more 'conventional' methods of computation and 
error correction when locality is taken into account. This is because the method of post-selection 
is based on attempting to create many states in parallel, of which a few may pass the test and 
are integrated in the computation. If the success probability is low, then at no additional cost in 
the nonlocal model, one can increase the number of parallel tries of creating these states. In the 
local model, however, it must be taken into account that an increase in the number of parallel tries 
increases the amount of qubit movement, and thus the potential for errors. 

In the first part of this paper, we make a purely analytical estimate of the threshold when locality 
is taken into account and show its dependence on a scale factor r, which is a measure of the spatial 
scale-up that is due to coding. This estimate can be applied to all known error models for which a 
fault-tolerance threshold result currently exists. 

Since this estimate may be very rough, we set out in the second part of this paper to analyze and 
compare, using the 'conventional' method of error correction as described by Steane in [5], the 
fault-tolerant behavior for the concatenated 7-qubit CSS [[7, 1, 3]] code for the local and nonlocal 
model. 
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In our analysis, we focus on concatenated coding and the threshold result. This is not to say 
that the strategy of using a large code once so that logical failure rates are small enough for the 
type of computation that we envision (see lloL may not be of equal or greater practical interest. 
In such a scenario, one 'merely' has to optimize the error correction procedures and encoded gate 
operations for locality. 

Here are some of our semi-analytical findings for the 7-qubit code. In these studies we have 
used the nonlocal error correction routine and have looked at the effects of the noise level during 
transportation of qubits and the scale-up of the computation due to coding. 

• In the entirely nonlocal setting, we find that one really needs to look at higher levels of 
concatenation to estimate a correct threshold. For the model where all gates have the same 
failure probability 7 e ; se and memory errors are one-tenth of the gate failure probabilities 
lw = 7eZse/10, we find a threshold value of 7 e / se = 3.4 x 1CT 4 . This is smaller than what 
Steane estimates in Ref. [5]. 

• We find that, in the local setting, the threshold scales as 6(l/r). For example, for r = 20 
and for the failure of movement over a unit distance equal to the failure probability 7 e / se , 
and for memory errors equal to one-tenth of 'jeise, we find that the threshold value for j e i se 
is 7.3 x 10" 5 . 

• We find that the threshold does not depend very strongly on the noise levels during trans- 
portation. 

• We find that infrequent error correction may have some benefits while qubits are in the 
'transportation channel'. 



II. A LOCAL ARCHITECTURE 



Let us first discuss the existence of a fault-tolerance threshold in the local model of quantum 
computation. It is clear that for unencoded computations an at most a linear (in the number of 
qubits) overhead is incurred in order to make gates act on nearest-neighbor qubits. 

If we perform concatenated coding in order to decrease the logical failure rate, we note that 
the circuit grows in size exponentially in the level of concatenation. Therefore, the distances over 
which qubits have to be transported (see JlS]) and thus the number of places in time and space 
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where errors can occur will increase. This will inevitably increase the logical failure rate at the 
next level of concatenation as compared to the logical failure rate in the nonlocal model. In order 
to be below the noise threshold, we want the logical failure rate to decrease at higher levels of 
concatenation. Thus it becomes a question of whether the extra increase in logical failure rate due 
to locality is sufficiently bounded so that there is still a noise value below which the logical failure 



rate decreases at t 
the literature, see 



le next level of concatenation. The question has been answered positively in 
jlS. In particular, in Ref. |9], two simple, significant observations were made 



which are important in deriving the existence of a threshold in local fault-tolerant computation: 

1. The most frequent operations during the computation should be the most local operations. 
For concatenated computation, the most frequent operation is lowest-level error correction. 
Thus the ancillas needed for this error correction should be adjacent to the qubits that are 
being corrected. The next most frequent is level 1 error correction, and so on. In Fig. d an 
example of a layout following these guidelines is given (see also BSD itself). 

2. The circuitry that replaces the nonlocal circuitry, say an error correction routine or an en- 
coded gate operation, should be made according to the rules of fault-tolerance. For example, 
it is undesirable to swap a data qubit with another data qubit in the same block, since a failure 
in the swap gate will immediately produce two data errors. Local swapping could potentially 
be done with dummy qubits, whose state is irrelevant for the computation. 



The third observation, which is less explicitly stated in Ref. (2D, is based on the following. Let 
us assume that we follow the requirement for hierarchically putting error correction ancillas near 
the data. We first start by making the original circuit a circuit with only nearest-neighbor gates 
according to the specific architecture. We call this circuit M and concatenate once to obtain circuit 
Mi, twice to obtain circuit M 2 , etc. In circuit Mi, we have replaced qubits from M by encoded 
qubits and their ancilla qubits for error correction (or local gate operations). Thus every qubit 
becomes a 'blob' of qubits with a certain spatial size. In order to do a two-qubit gate g from M , 
we have to move the data qubits in this blob past or over these ancillary qubits in order to interact 
with other data qubits (see J3]). Let us say that the scale of the blob is given by a parameter r so 
that in order to do the encoded two-qubit gate the qubits have to be moved over a distance r. At the 
next level of concatenation, again every qubit 'point' becomes a blob, which implies that in order 
to do the doubly encoded version of g E M , a doubly encoded block has to move over distance 
r 2 . The two-qubit gates in the error correction of M 1 involving the level 1 error-correcting ancillas 
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have to be moved over a distance r and the level error-correcting ancillas, which are added in 
M 2 , are 'local', assuming that we made the error correction routine itself local. Thus in general, 
in M n , level k ancillas, k — 0, . . . , n — 1, may have to be moved over a distance which scales as 
r k , exponential in the number of levels of concatenation. 

Let us assume that the failure probability of a travelling qubit is approximately linear in distance 
d, i.e. p err = 1 — (1— p) d ~ dp where p is the failure probability per unit distance. For many 
implementations, the distances involved in moving level k ancillas, as well as the failure rates, 
will be far too large and error correction will have to be done frequently while the qubits are in 
transit. In fact, a threshold will probably not even exist if there is no error correction done in 
transit. This is because at some level of concatenation the failure rates for the high-level ancillas 
are such that these ancillas completely decohere in transit. At that point, any additional level of 
concatenation can only make things worse, not better. In Section|InJ we give the details of a model 
where (lower-level) error correction on 'moving qubits' is included in the concatenation steps. 

If we think about realistic architectures for any type of physical implementation, it is likely that 
the stationary qubits lie in a one-dimensional, two-dimensional, or a few stacks of two-dimensional 
planes, potentially clustered in smaller groups. The reason is that one likely needs the third di- 
mension for the classical controls that act on the qubits as in ordinary computation. 

Given the discussion above, we can imagine a two-dimensional layout of qubits as in Fig.[T] In 
Mi, every block of data qubits surrounds stationary level ancillas, indicated by the white area. 
The data qubits themselves have to be moved (over distance r) either out of the plane, or by 'wires' 
in the plane, in order to interact with the nearest-neighbor block of data qubits. In M 2 , we again 
have the stationary 'white' level ancillas, light gray areas for level 1 ancillas that now have to be 
moved over distance r, and the dark gray areas for data qubits which potentially have to be moved 
over distance r 2 . 

In this paper, we do not go into details about the mechanisms behind qubit movement. Inside 
the error correction procedure, depending on the implementation, one may think about swapping 
qubits or creating short-ranged EPR pairs in order to teleport qubits. For the longer distances, 
one may create a grid of EPR pairs, using quantum repeater techniques which is maintained 
by frequent entanglement distillation, or alternatively convert stationary qubits into more mobile 
forms of qubits (photons, spin-waves, etc.). In Section|inl we lay out a model for error correction 
'along the way', but we do not discuss how or where in space this additional error correction can 
take place. This could be the subject of future research. 
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FIG. 1: Two-dimensional plane with the spatial layout of Mi and Mi- The grayness of the area indicates 
the amount of moving the qubits potentially need to do. 



HI. LOCAL FAULT-TOLERANCE: AN ANALYTIC LOWER BOUND 

We follow the derivation of fault-tolerant quantum computation as in Ref. 0, 

which has also 

been used in J4 ] to deal with more general error models such as non-Markovian noise. 

We denote the original quantum circuit as M , consisting of iV locations. Each location is 
denoted by a triple ({g , • • • , Qi}, U, t), where the set of qj, 1 < j < 2, are the qubits involved in 
the operation U at time t. U is restricted to one- and two-qubit gates for simplicity and can be the 
identity operation. We fix a computation code C which encodes one qubit in m qubits. To achieve 
a fault-tolerant circuit, we concatenate this code recursively n times to create the circuit M n that 
simulates, to n levels of concatenation, the original circuit M . 
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The main change that occurs when including locality constraints in the fault-tolerance deriva- 
tion is that additional 'move' operations and error correction needs to be added. Secondly, the 
error correction procedure needs to be made local. How the latter task is done and what overhead 
is required will very much depend on the code. We will not focus on this issue in this paper. 

Consider a particular example of a location, for example a two-qubit gate. This gate gets 
replaced by a so-called 1 -rectangle in M 1 , which consists of error correction on both blocks of 
qubits followed by the encoded gate operation, shown in Fig. O 
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FIG. 2: The replacement rule for a two-qubit gate U. The dashed box represents a 1 -rectangle. £ represents 
the error correction procedure. U represents the encoded, fault-tolerant implementation of U. 



In the local model, this replacement rule that we repeatedly apply to obtain the circuit M n gets 
modified as in Fig.|3j While one block gets moved over a distance r, which we denote as a move(r) 
operation, the other block is waiting. Next, the fault-tolerant implementation of the original gate 
is executed locally and then the block is moved back in place. We precede the move and wait 
operations by an error correction routine, just as for the gate U . The model that we consider here 
assumes that the error levels induced by moving over distance r may be similar to the error levels 
due to the execution of the gate U. If moving is more error-prone, we may divide the distance r 
into shorter segments of length d,r = rd, and error correct after every segment if necessary. This 
modification and its effects will be considered when we make our detailed analysis in Section IvTl 

We see that in the local model each location in M n _i gets replaced by potentially more than 
one 'elementary' 1-rectangle in M n . Since this set of rectangles forms a logical unit, we will call 



A Replacement Rules 
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FIG. 3: The replacement rule for a local two-qubit gate U. Each dashed box represents an elementary 1- 
rectangle. £ represents the error correction procedure. U represents the local fault-tolerant implementation 
of U. The replacement circuit, i.e. the composite 1-rectangle, contains five elementary 1-rectangles. 

the sequence of elementary 1-rectangles a composite 1-rectangle. 

In the next section, we derive a rough lower bound on the threshold in the local model, depend- 
ing on a scale parameter r. 



A. Replacement Rules 

We formulate replacement rules for all possible other locations in the local model. We only 
consider locations that occur in the [[7, 1, 3]] code. Additional rules may have to be formulated 
for other codes, but the threshold estimate in this section will not depend on these details. We 
assume in formulating these replacement rules that a one-qubit gate is never executed in parallel 
with a two-qubit gate (this is correct for the [[7, 1, 3]] code that we study in Section fVTb : this means 
that the execution of the one-qubit gate is not delayed by the additional moving required for the 
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two-qubit gate. Note that we have two types of memory locations, which we call wait locations, 
depending on the type of gate (one- or two-qubit gate) occurring in the same time slice. The figures 
depicting these rules can be found in Appendix A. Here is a list of the distinct locations in the local 
model, also listed in Table II VI in Section llVAl and their replacement rules: 

1 . a one-qubit gate, depicted in Fig.fTTl 

2. a one-qubit gate followed by a measurement, depicted in Fig.[l2j We group a measurement 
with a one-qubit gate, since the replacement rule for a measurement by itself is just doing 
m measurements on m encoded qubits. 

3. a two-qubit gate U, depicted in Fig.|3j 

4. a wait location in parallel with only one-qubit gates, denoted as waitl or wl. The replace- 
ment rule is the same as for a one-qubit gate CFig-fTTTl. 

5. a wait location in parallel with two-qubit gates, denoted as wait2 or w2, depicted in Fig. [13] 

6. move(r), the operation which moves one qubit over distance r, where r depends on code 
properties, depicted in Fig. [141 

7. wait(r), the operation which does nothing while another qubit moves over distance r, de- 
picted in Fig.fT3I 

Note that our replacement rules enforce synchronization of gate operations and waiting periods. 
Note that at each new level of concatenation, every distance gets multiplied by the scale factor r, 
so that a move(r) gate becomes r move(r) 1-rectangles. We would like to stress that the goal here 
has been to choose a set of level-independent replacement rules that capture the overall behavior; 
architecture, code-dependent and concatenation-dependent optimizations are not considered. 

In order to apply the rules repeatedly, the encoded gate U is broken down into local elementary 
gates (potentially using additional swap gates) and the replacement rules are applied to these local 
gates. 



B. Threshold Estimate 

As was noted in Ref. J12I1 and explicitly stated in Ref. [4], the formal derivation of a fault- 
tolerance threshold hinges on three Conditions (under the usual assumptions of having fresh an- 
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cillas and being able to operate gates in parallel). Fault-paths are subsets of all locations on which 
faults occur. The conditions are, loosely speaking, the following: 

1. 'Sparse' fault-paths (with few faults) lead only to sparse errors. 

2. 'Sparse' fault-paths give good final answers. 

3. Non- sparse fault-paths have small probability /norm, going to zero with increasing concate- 
nation level for initial failure probabilities/norms per location below some threshold value. 

The first two statements are unchanged when going from a nonlocal to a purely local model of 
computation, assuming that the error correction routines are made local in a fault-tolerant manner. 
It is the third Condition whose derivation gets modified in this model. For concreteness, let us 
assume that our error model is a probabilistic error model, where each location undergoes a failure 
with some probability 7(0). At an intuitive level, every location gets replaced by a composite 
1 -rectangle, which fails when at least one of the elementary 1 -rectangles fails. If we assume that 
every type of 1 -rectangle has a similar failure probability 7(1), then the composite 1 -rectangle 
which is most prone to failure is the one originating from the move(r) gate (r >> 5) since it 
consists of r elementary 1 -rectangles. In order to be below the threshold, the failure probability 
of the composite 1 -rectangle has to be smaller than the failure probability of the original location, 
i.e. 

7o = 7 (0)>l-(l- 7 (l)) r « 7 (l)r. (1) 

Let us assume that A^ c is (an upper bound on) the number of locations in an elementary 1- 
rectangle that has been made local. We say that a 1 -rectangle fails if, say, more than k among 
these locations have faults. Here k = [d/2t\ for a code with spread t which can correct d errors. 
Thus 7(1) PS (fc+i)7(0) fc+1 and we get the threshold condition 



70 C rit - 




The difference with the nonlocal model is the appearance of r on the right-hand side of this equa- 
tion. Note that the effects of locality seem to become effectively smaller for large k, i.e. for codes 
that can correct many errors. On the other hand, the scale factor r itself increases for codes that cor- 
rect many errors, since the number of qubits in an encoded word and the size of the error-correcting 
machinery is larger. The [[7, 1, 3]] code that we analyze in more detail in the next section does not 
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entirely fit this analysis. The reason is that for the [[7, 1, 3]] code, d = 1 and t = 1, causing k to 
be zero; this is because one can have one incoming error (a late error in the previous rectangle) 
and one early error in the rectangle, leaving two errors on the data, which [[7, 1, 3]] cannot correct. 
However a different analysis II 1 311 for such codes shows that one-error events in bigger 'overlap- 
ping' rectangles (which include error-correction, gate operation and error-correction again) are 
acceptable for this code, so k actually equals one. Thus we expect for the [[7, 1, 3]] code that the 
threshold for the local model scales as 1/r, which we partially confirm later. 

A more formal analysis uses the notion of n-rectangles in M n . We state the definitions as given 
in Ref. in Appendix 151 In M n , a composite n-rectangle originates from a single location 
in Mq. The composite n-rectangle consists of at most r elementary n-rectangles. Each of these 
elementary n-rectangles consist of at most Ai t c composite (n — 1) -rectangles, each of which again 
consists of at most r elementary (n — 1) -rectangles. Formally, we need to prove Condition (3) 
above, namely that the probability (assuming a probabilistic model) for sparse 'good' faults gets 
arbitrarily close to one when we are below the threshold. Here we state the necessary lemma, 
which has identical structure to the one in lfl2l : 

Lemma 1 If jo < 7o cr it> 3<5 > such that the probability P(n) for the faults in a composite 
n-rectangle to be (n, k)-sparse is larger than 1 — 7q 1+< ^ • 



r[^M +l <ll +& - (3) 



Proof: Let 5 be such that 

r I 

K k + 1 

For 7 below the threshold, we can find such a 5. We prove the lemma by induction on n. The 
probability for a composite 1 -rectangle to have (1, k) -sparse faults, i.e. all elementary 1 -rectangles 
(of which there are at most r) have sparse faults, is at least 

Al,C V Jfc+A „ ( Al,C \ k+1 . -i 1+5 



k+1J ir) >l-^ fc -j7„->l-7„-, (4) 

using Eq. ©. Assume the lemma holds true for n and we prove for n + 1. For the faults in 
a composite (n + 1) -rectangle not to be (n + 1, k) sparse, there must at least be 1 elementary 
(n + l)-rectangle in which the faults are not (n + 1, fc)-sparse which implies that in that rectangle 
there are at least k + 1 composite n-rectangles which are not (n, k) -sparse. Thus, 

P(n + 1) > fl - (1 - P(n)) fc+1 ) ' > 1 - r (1 - F(n)) k +\ (5) 
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Using the induction hypothesis and Eq. © then gives 

P(n + l)>l-( 7o ) (1+5) " + \ (6) 

as desired. □ 

We note that a similar analysis could be performed for any other noise model which is derived 
with the method used in Ref. lb?! , such as noise satisfying the exponential decay conditions or 
local non-Markovian noise The proof of Condition (3) in these cases needs to be altered to 
take into account the dependence on r. 

IV. NONLOCAL FAULT-TOLERANCE FOR THE 7-QUBIT [[7, 1,3]] CODE 

In order to make a good comparison between using a concatenated [[7, 1, 3]] code in the local or 
nonlocal model, we perform a fault-tolerance analysis for the nonlocal model. In Ref. [5], Steane 
performed such an analysis and we follow his analysis to a certain extent. At the end of this 
section, we summarize our findings for the nonlocal model. The goal is to produce a threshold in 
the right ballpark, taking into account various (but not all) details of the error correction circuitry. 
The details of error correction are depicted in Figs. [T6l - fT9l in Appendix ICl and can 

be described as follows. Error correction of a 7-qubit block consists of X- and Z -error correction 
denoted as X and Z. For both types of error correction, one prepares n rep ancillas, using the Q 
network in Fig. [T7] These ancillas are tested for X errors using the V network in Fig. fTSIand 
discarded if they fail the test. The probability for passing this test is called a. If they do pass the 
test, they can be used to collect the syndrome as in Fig. [T9l If the first collected syndrome is zero, 
then no further syndromes are collected (the idea being that it is likely that there is no error on the 
data). The probability for a zero syndrome is called (3. If the syndrome is nonzero, an additional 
s — 1 syndromes are collected. These s syndromes are then compared and if there are s' among 
them which agree, then error recovery, denoted by 1Z, is done according to this syndrome. If there 
are no s' which agree, no error correction is done and in our model (see [5] for modifications) we 
do not use these error syndromes in any subsequent error correction. 

Let us now consider the problem of determining the fault-tolerance threshold by semi-analytical 
means. At the base-level, we start with a vector of failure probabilities of the locations in our model 
which we call 7(0). In our case we have the following five kinds of locations /; a one-qubit gate 
(1 = 1) with failure probability 71 = 71 (0), a two-qubit gate (I = 2) with failure probability 72, 



14 



a wait location (I = w) with failure probability 7^, a one-qubit gate followed by measurement 
(/ = lm), with failure probability j lm , and a |0) preparation location with failure probability j p . 
Table[Qlists these types of locations in the nonlocal model. 



Location 


Description 


Failure Prob. 


1 


one-qubit gate 


7i 


2 


two-qubit gate 


72 


w 


memory (wait) 




lm 


one-qubit gate + measurement 


7lm 


P 


preparation 


7p 



TABLE I: Types of locations and their failure probability symbols in the nonlocal analysis. 

There are several ways in which one can do a fault-tolerance analysis. The first method is 
to perform a Monte-Carlo simulation (see, for example, Jfl Q, [3]) of a sequence of operations 
for some level of concatenation and deduce a failure or crash probability. The advantage of this 
method is that it takes into account incoming errors into rectangles and then it otherwise exactly 
mimics the failure probability in the real quantum computation. The disadvantage, in particular 
for large codes, is that it is hard to simulate high levels of concatenation, since the size of the 
classical computation scales exponentially with concatenation level. As we discuss in a moment, 
and demonstrate in our studies, simulating more than one level of concatenation is often needed to 
nail down the threshold. 

The second method is a semi-analytical one, which we follow, to obtain an approximate proba- 
bility flow equation. Due to concatenation, each location is represented by a rectangle, which has 
some probability of failure, meaning that at the end of the rectangle there are more errors on the 
data than the code can correct. Thus after one level of concatenation, the probability vector 7(0) 
is mapped onto 7(1), and we repeat this procedure. We say that the original vector 7(0) is below 
the threshold if 7(71) — > for large enough n. The drawback of this kind of analysis is that careful 
approximations need to be made in order to estimate the failure probability function of a rectangle, 
since a complete analysis may be too complicated. Furthermore, the analysis does not deal so well 
with incoming errors, since we look at one 1 -rectangle at a time. The advantage is that it is easy 
to look at high levels of concatenation. 

In the next section, we approximate the failure probability function 7/(n) = ^(7(71 — 1)) for 
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the different types of 1 -rectangles. First, we describe the modelling assumptions we have chosen. 
A. Modelling Choices 

• We assume that the time it takes to do a measurement is the same as the one-qubit gate time 
and that classical post-processing does not take any additional time. 

• We have chosen to call a one-qubit gate followed by a measurement a single location. The 
reason is that there is no explicit concatenation step for measurement, since each measure- 
ment just gets replaced by seven measurements and classical post-processing to correct for 
errors. We choose to set the failure probability of a measurement 7 m = 7i- Thus the failure 
probability for the location Im is approximated as 7i m ~ 71 + 7™. As it turns out, there are 
no two-qubit gates followed by measurement in the [[7, 1, 3]] error correction routines, and 
a wait or memory location of any length followed by a measurement is just measurement, 
since there is no reason to wait. 

• A preparation of the state |0) is a preparation location with a preparation failure probability 
7 P . For simplicity, we may set 7 p = 7^ At the next level of concatenation, this location 
will be replaced by an encoding circuit. Preparing an encoded |0) can be done by first per- 
forming error correction on an arbitrary state which projects the state into the code space 
and then measuring the eigenvalue of the encoded Z operator fault-tolerantly and correct- 
ing if this eigenvalue is —1. Even though the last procedure, done fault-tolerantly, will be 
more involved than the execution of a transversal one-qubit gate, we assume that the en- 
coding/preparation rectangle is of the one-qubit gate type. In other words, we do not use a 
separate replacement rule for a preparation location. 

• We will typically work in the regime where 7^ < 7^2, perhaps an order of magnitude 
smaller. 

• We assume (here and in the local model) that our quantum circuit contains only controlled- 
Z (C z ), controlled-not (C x ), and Hadamard gates (H). Note that these can all be executed 
transversally. Of course, in order to make the computation universal, one would also need, 
e.g., a Toffoli gate or n/8 rotation. We believe that the inclusion of the n/8 gates would not 
alter the threshold in the local model very much. The reason is that (1) error-correction does 
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not need the n/8 gate and thus n/8 gates are fairly rare, (2) the n/8 gate, as a 1-qubit gate, 
can be executed locally and (3) the failure probability for the 1 -rectangle of the 7r/8-gate is 
probably similar or lower than that of the 2-qubit gate since it involves only one data block 
(and some ancillary state). As it turns out, already the inclusion of the two-qubit gates has a 
sizable effect on the threshold estimate. 

The error correction procedure as described in the previous section is not of fixed size; for 
example, it depends on the number of syndromes collected and whether or not we do a recovery 
operation. Here are some choices that we make which directly affect how we calculate the failure 
probability in the next section. These assumptions are not exactly the same as the ones made in 
Ref. Ji: 

• The procedures for error correction are of course parallelized as much as possible to reduce 
errors due to waiting. As can be seen in the figures, the syndrome collection network S (Fig. 
IT9t then takes 3 time steps, the network Q (Fig. ITTt has 5 time steps and V (Fig. IT8t has 
6 time steps. We assume that the 4 verification bits are prepared while the Q routine takes 
place. 

• We choose s, the maximum number of syndromes collected, to be s = 3 and s' = 2. 

• In every round of the computation, we assume that a nonzero syndrome occurs somewhere, 
so that in order to keep the network synchronized, the other data blocks have to wait for the 
additional s — 1 syndromes to be collected. We take these wait locations into account. 

• We assume that a sufficient number n rep of new ancillas is prepared in parallel before the 
beginning of each error correction routine. We set n rep = |~~] , so that on average we have 
enough ancillas for error correction. We assume that the ancillas are prepared during the 
previous error correction procedure so that the data does not have to wait in order to be 
coupled to the ancillas. These assumptions are a bit too optimistic, since a nonlocal ancilla 
preparation and verification routine, see Figs. \T7\ and [TH takes 11 time steps, while three 
syndrome collection routines, see Fig.[J21 take 9 time steps in total (and this will be worse 
in the local version of these procedures since ancillas have to be 'moved in place' to couple 
to the data). 

• We assume that the prepared ancillas for the last s — 1 syndrome collections have to wait 
before the previous syndrome collections are done. This could potentially be avoided, but 
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we may as well include some extra wait locations since other approximations may be too 
optimistic. 

• In principle, we may not have enough syndromes in agreement, so that no error correction is 
performed, and secondly we could have enough syndromes agreeing but the syndrome may 
be faulty so that we do a faulty recovery operation. The latter probability may be quite small 
since errors have to 'conspire' to make a faulty but agreeing syndrome, so we will neglect 
this source of errors. If we choose not to do error correction, we may have more incoming 
errors in the next routine; we do model incoming errors to some extent in our estimation of 
a and (3, but we will not consider this source of errors separately. 

• In the estimation of the failure probability we always assume that faults do not cancel each 
other. 

• We are working with the probabilistic error model where each gate or location can fail with 
a certain probability. For a location on a single qubit that fails with probability 7, we say 
that a X, Y or Z error occurs with probability 7/3. We will use this distinction between X, 
Y and Z errors in our estimation of a and (3 in the next section. 

B. Failure Probability 

For the [[7, 1, 3]] code failure of a 1 -rectangle means that two or more errors occur on data 
qubits during the execution of the operations in the 1 -rectangle. This could happen when we have 
a single incoming error and, say, a syndrome collection gate, such as C z , introduces an additional 
error on the data and the ancilla. In estimating the failure probability, we do not take into account 
incoming errors since below the threshold the probability for incoming errors should typically be 
small. The circuits are designed such that if there are no incoming errors and a single fault occurs 
in the 1 -rectangle, that fault will typically either not affect the data, or will be corrected. Only 
if the fault occurs late in the routine, say in the encoded gate operation, will the fault be passed 
on to the next error correction routine. Thus we assume that two faults affecting the data are 
needed for failure. First, let us consider those 1 -rectangles which involve a single data block, i.e. 
/ = 1, lm,p, w. Let Fi[s x , sj(7) be the failure probability for a rectangle of type / when s x and 
s z syndromes in resp. X and Z are calculated. We can write 

7i(n) = /3 2 F,[1, l](7(n - 1)) + 2/3(1 - /J)F,[ a , l](?(n - 1)) + (1 - /3) 2 F*[s, s}tf(n - 1)). (7) 



B Failure Probability 



18 



From now on, we will omit the dependence on concatenation level, i.e. we express F; in terms of 
7j. Let P(e + E T,s x ,s z ) be the probability of e or more faults on the data block due to source T 
when s x and s z syndromes are calculated. We may model 

P(l+ E T, s x , s z ) = 1 - (1 - 5(T)fF>*>"\ (8) 

where S(T) is the failure probability of the particular location (or event) in T which causes the 
fault and N(T, s x , s z ) counts the number of places in T where the fault can occur. Similarly, we 
have 

P(2+ E T, s x , s z ) = 1 - (1 - 5(T)) N <T> a "'J - 5(T)N(T, s x , s z )(l - 6 (T)) N ^ S ^\ (9) 

In Table ITT1 we describe the possible sources of faults on the data and their values for 5 and N. For 
failure to occur, we can typically have one fault due to source / and one due to source J or two 
faults due to source /. In other words, we approximate 

F^, s z ] « P ( 1+ e J ' s *> S -) P ( 1+ e J ' s *> s ^ + Yl P ( 2+ G J ' (10) 
I>J I 

Some of the parts of the first term give somewhat of an overestimate, since a single fault in, 

say, X and a single fault in Z does not necessarily lead to a failure. Also, note that we are 

overcounting some higher order fault-terms, but these should be small. Note that the / dependence 

of the right-hand side of Eq. (fTOb only appears in the terms that involve the faults due to encoded 

gate operations listed in Table ITTI Note that we do not distinguish between X,Y or Z errors in 

estimating the failure probability. 

For a / = 2 (C x or C z ) 1 -rectangle the analysis is slightly more involved. Let 

F[s Xl , s zi , s X2 , s Z2 ] be the failure probability of the two error correction routines on block 1 and 2 

when s xi and s zi syndromes are computed for block 1 and s X2 and s Z2 syndromes are computed 

for block 2 (without the subsequent gate operation). Let rrij E {0, 1} such that rrij = when 

Sj = s and rrij = 1 when Sj = 1, where j E {xi, X2, z±, z%\ and s is the number of syndrome 

measurements. We can then write 

72 (n) = ^m xl +m X2 +m zl +m Z2 x 

1^2^ )^ x 2 i^ z 2 = '® 

(1 _ (3Y-^-^-^'^Y[s Xl ,s Zl ,s X2 ,s Z2 ]{%n - 1)). (11) 

Let F(s x , s z ) be the failure probability of one error correction routine when s x and s z syndromes 
are calculated, i.e. it is Eq. (TTOb with the additional constraint that the source is never the encoded 
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Source 


5 


N 


Propagation from a verified ancilla with X error 


^anc 




Fault in C z or C x in S 


72 


7(s x + s z ) 


Memory faults on data at the end of S 


lw 


U(s x + s z ) 


Memory faults on data during 1Z 


lw 




Fault in gate of 1Z 


71 + Iws 


3s z ,S ~i~ &S X ,S 


Memory faults on data when s = 1 


Iw 


21(s-l){6 a , A + 5 8mtl ) 


X errors on ancillas waiting for S 


lw 


21s(s-l)(5 Sz , s + 5 Sx>s )/2 


Encoded gate error in rect. of type I 


11 


7 



TABLE II: Different sources of failure and their contribution to the failure probability. Here 5 anc = 1 — 
P(no X | pass) where P(no X \ pass) = P(pass and no X)/a and P(pass and no X) is the probability that 
an ancilla passed verification and has no X errors on it. This probability is estimated in Section llVCI The 
probability ^y ws for obtaining a wrong majority syndrome is assumed to be in our analysis. 

gate. Let P(l + G T, s xi , s Zl , s X21 s Z2 ) be the probability of one or more faults anywhere due to 
source T in the two error correction routines calculating s xi , s Zl , s X2 , s Z2 syndromes, that is, the 
number iV in Eq. © gets modified to N(T, s Xl , s zi , s X2 , s Z2 ) which is similar to the ones in Table 
HH except that we add the contributions from both error corrections. Then for C A , where A = X 
or A = Z, we approximate 

F(s Xl ,s Zl ,s X2 , s Z2 ) w P(2 + e C A ) + 7 72 (1 - 72 ) 6 ^P(l+ e I, s Xl ,s Zl ,s X2 ,s Z2 ) + 

I^G 

(l- l2 y[F(s Xl ,s Zl ) + F(s X2 ,s Z2 )]. (12) 

The first term represents the contribution from having two or more faults in the two-qubit gate, 
the second term represents one gate fault and one or more faults somewhere in the error correction 
routines and the third term represents no faults in the gates and two or more in either the error 
correction on block 1 or block 2. 

C. Estimation of a and fi 

Our next task is to provide estimates for a, the probability of an ancilla passing verification, 
and (3, the probability of obtaining a zero syndrome. One can find another estimation of a and 
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ft in Ref. [5]. Similar to the failure probability, a and ft are functions of concatenation level, i.e. 
Fj(7(n — 1)) involves the functions a(n — 1) = a(j(n — 1)) and ft(n — 1) = ft{^{n — 1)). In the 
following we omit the concatenation level dependence, i.e. we express a and ft in terms of ji. 

For CSS codes, error correction is performed in two steps. While X and Z errors are detected 
in only one of the two steps, Y errors contribute to both. Hence if X, Y, Z errors are equally likely, 
the probability to detect an error is 2/3p for each step, where p denotes the total error probability. 

In the following paragraph we will speak of events that are detected as X errors or Z errors. 
Thus if a Y error occurs this results in both an X and Z error event. 

The fraction a of ancillas that pass verification can be calculated as 

a = P(pass and no X) + P(pass and X) = 
P(pass and no X) + P(pass and no Z) — 
P(pass and no Z, no X) + P(pass and Z,X). (13) 



The last probability we approximate as P(pass and Z,X) « 0. The next table shows what types of 
errors should be avoided in order to have a passing ancilla and no X or no Z errors. 





prep. ver. bits 


H+meas. ver. bits 


from Q 


early wait in V 


late wait in V 


ver. wait in V 


P(pass and no Z) 


X,Z 


X 


X,Z 


X,Z 


Z 


X,Z 


P(pass and no X) 


Z 


X 


X 


X 


X 


Z 



TABLE III: Types of errors in various subroutines that should not occur when ancilla passes verification 
and should have no X or no Z errors. When we write Z, it implies that neither Z nor Y should occur, 
since Y is both an X and Z error. Late wait indicates the wait locations on ancilla qubits that are finished 
interacting with the verification qubits. Early wait locations indicate the wait locations that occur before the 
last interaction with the verification bits. Verification wait locations indicate the wait locations that occur 
on the verification qubits. Strictly speaking, for the contribution to P(pass and no Z) we should distinguish 
between early and late wait errors on the verification qubits; we approximate this by requiring no types of 
errors on the verification qubit wait locations. 

For the C z gates, the exact contributions from various errors is harder to estimate (one has to 
examine the cases more carefully), so we approximate this by saying that in order to have a passed 
ancilla and no Z or no X error on the ancilla, all C z gates have to have no errors. This implies 
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that 

P(pass and no Z) = (1 - 7p ) 4 (l - 7l ) 4 (l - 2 7lm /3) 4 x 

IW1 - 7i )^ e6) (l - 2W3) 26 (1 - 7w ) 6 (l - 72 ) 13 , (14) 

and, slightly different, 

P(pass and no X) = (1 - 2 7p /3) 4 (l - 2 7l /3) 4 (l - 2 7lm /3) 4 x 

n ieg (l - 2 7i /3)^ e ^(l - 2 7w /3) 32 (l - 72 ) 13 . (15) 

Assuming that none of the possible faults occurs, then we can say that 

P(pass and no Z, no X) « 1^(1 - li ) N ^ ieg ^\ (16) 

From these estimates we can calculate a. 

Next we approximate f3, the probability of obtaining a zero syndrome, in a X-error correction 
routine as 

P P(no Z errors on anc. | ancilla passed) x P(no Z errors on syn. due to S) x 

P(no X error coming into X). (17) 

We have 

P(no Z errors on anc. | ancilla passed) = P(pass and no Z)/ct. (18) 
It is easy to estimate 

P(no Z errors on syn. due to S) = (1 - 2 72 /3) 7 (l - 2 7im /3) 7 . (19) 

Thirdly, we have 

P(no incoming X error in X) = P(no incoming X error in Z) x 
[/3P(«Si e 2 leaves no X error)P(no X err. on waiting data) + 

(1 - /3)P(Si,2,..., s G ^ leave no X error)] , (20) 

What is the probability P(no incoming X errors in Z)l If we assume that the previous X did 
its job, i.e. removed the errors, the only source of error is the gate that was done after X. Since we 
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do not know which gate was performed, we assume that the most error-prone gate occurred. Since 
all gates in our model are transversal, we approximate 

P(no incoming X errors in Z) « (1 — 2(max7j)/3) 7 . (21) 

i 

We further estimate 

P(«Si G Z leaves no X error) = P(«Si gives no X errors on data) x 

P(no X errors on anc. | anc. passed). (22) 

where 

P(5i gives no X errors on data) = (1 - 27 2 /3) 7 . (23) 

Lastly, we have 

P(no X errors on anc. | anc. passed) = P(pass and no X)/ct. (24) 

We also estimate 

P(«Si,2,..., s G Z leave no X error) « P(5j G Z leave no X error)*. (25) 

This estimate does not include the fact that the prepared ancillas may have to wait (and degrade) 
until they are coupled to the data. If there is only one syndrome collection, the data may have to 
wait until other full syndrome collections are done. We take this into account with 

P(no X err. on waiting data) = (1 - 2 7w /3) 21{s " 1) . (26) 

Thus, using Eqs. (ITvb - (l26l) . we arrive at a closed formula for (5. 

V. NUMERICAL THRESHOLD STUDIES FOR THE NONLOCAL MODEL 

We have used the formulas for failure probabilities, a, and /? of the last two subsections to 
quantify the fault-tolerance threshold for the nonlocal model. We study the effect of the repeated 
application of the map F/(t*), namely the dependence of the parameters on concatenation level. 
This is a four-dimensional map — there are five probability variables, but under our assumptions 
71 and 7 P behave identically. This four-dimensional flow is of course impossible to visualize 
directly, but two-dimensional projections of these flows prove to be very informative. 
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FIG. 4: Flows of the one- and two-qubit failure rates under concatenation in the nonlocal model. We 
initially set 71 = 72 = 7p = 7m = 10 x 7^. Four starting values are shown, two below threshold and 
two above. The initial flow is evidently very similar regardless of whether the map is above or below 
threshold. The hyperbolic structure of the flow is controlled by an unstable fixed point of the map at 
7i = 1w = 7im = 7p = 0.69 x 10~ 4 , and 72 = 1.50 x 10~ 4 , shown as the black "star" symbol. Note that 
the line onto which these flows asymptote has 72 very close to 2 x 7!. 

In FigureslU-EDwe show three instances of such a projected flow in the 71 — 72 plane. In Fig.|U 
we have initially taken the memory failure probability to be 10% of the gate failure probability and 
one- and two-qubit gate failure probabilities to be equal; that is, prior to concatenation, we take 
7i = 72 = 7 P = 7m = 10x7 w . In Fig.0 we initially take 71 = 0.25 x 72 = 7p = 7m = IOX7™. In 
Fig-El we initially take 71 = 2.0 x 72 = j p = j m = 10 x j w . With these initial choices, we look at 
the flows as we concatenate the map. Figures |4|-|6] show the behavior as the threshold noise value 
is crossed. As is common in renormalization group flows, these have a hyperbolic character; the 
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FIG. 5: Flows of the one- and two-qubit failure rates under concatenation in the nonlocal model. We 
initially set 71 = 0.25 x 72 = 7 P = j m = 10 x j w . Four starting values are shown, two below threshold 
and two above. The initial flow is evidently very similar regardless of whether the map is above or below 
threshold. The hyperbolic structure of the flow is controlled by an unstable fixed point of the map at 
7i = 1w = 7im = 7p = 0.69 x 10~ 4 , and 72 = 1.50 x 10~ 4 , shown as the black "star" symbol. Note that 
the line onto which these flows asymptote has 72 very close to 2 x 71. 

flows all asymptote to a one-dimensional line (for which, as can be seen in the figures, 72 ~ Z^i). 
In Fig. |4j for all initial points up to 72 < 3.35 x 1CT 4 , the flows follow this line to the origin, 
indicating successful fault-tolerant computation; for all higher failure rates the flows asymptote to 
one, indicating the failure of error correction. 

The whole character of the flow is set by the presence of an unstable fixed point at the black 
star, at approximately 71 = 7™ = 7i m = 7 P = 0.69 x 10~ 4 , and 72 = 1.50 x 10" 4 in Figs.|U-0 
It is evident that the linearized map around this point has one positive (unstable) eigenvalue and 
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FIG. 6: Flows of the one- and two-qubit failure rates under concatenation in the nonlocal model. We 
initially set 71 = 2.0 x 72 = 7 P = 7 m = 10 x 7^. Four starting values are shown, two below threshold 
and two above. The initial flow is evidently very similar regardless of whether the map is above or below 
threshold. The hyperbolic structure of the flow is controlled by an unstable fixed point of the map at 
7i = lw = 7im = 7p = 0-69 x 10~ 4 , and 72 = 1.50 x 10~ 4 , shown as the black "star" symbol. Note that 
the line onto which these flows asymptote has 72 very close to 2 x 71. 



four negative ones. 

The threshold, of course, is not a single number; it is the separatrix between points in the four- 
dimensional space of failure probabilities that flow to the origin upon concatenation, and those that 
flow to one. This separatrix is a three-dimensional hypersurface. A one-dimensional cut through 
this hypersurface is shown in Fig. |7] This is shown in the plane of memory failure 7^ versus all 
other failures, with all these rates taken to be the same: 7 e ^ e = 71 = 72 = 7 P = 7m- The threshold 
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curve (indicated with black 'dot' symbols) is nearly approximated by a straight line. 
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FIG. 7: The threshold line and pseudothreshold curves shown in the plane defined by the memory failure 
rate 7^ and all other failure rates 7 e / se = 71 = 72 = 7p = 7m- The pseudothreshold is defined as the 
line along which one of the failure rates remains unchanged after the first iteration of the map; closer to 
the origin, this failure rate decreases, further away it increases. The pseudothresholds for 71, 72, and 7^ 
are shown. We note that along the line (dotted) for which j w = 0.1 x j e i se , a popular condition in earlier 
studies, the gate pseudothresholds, particularly for the one-qubit gate failure rate, are much higher than the 
true threshold. 



In Ref. q] it has been suggested that a reasonable estimate for the threshold can be obtained by 
finding the failure rate for which the error is unchanged after the first concatenation of the error- 
correcting code. Figures |4] and |7] indicate that this rule of thumb actually has limited value (see 
1 20]). For all plotted initial points in Fig. |4j the failure probabilities go down after one level of 
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concatenation. However, after one more level of concatenation, two of the failure probabilities go 
up again indicating that those two initial points were above threshold. 

In Fig. 13 we investigate this further by plotting three "pseudothresholds" along with the actual 
threshold curve. These pseudothresholds are the lines along which 71, 72, and 7™ are unchanged 
after one iteration of the map. Obviously, these three are very different from one another and 
from the true threshold curve. Rather than being straight, the pseudothresholds are very curved. 
They curve in to the origin for a very simple reason: if 71 and 72 are initially zero, then no matter 
what the value of j w (i.e. anywhere along the y-axis of the plot), 7! and 72 become nonzero after 
one iteration, so every point on the y-axis is above these pseudothresholds. The corresponding 
statements hold about the s-axis for the j w pseudothreshold. 

We note that, particularly in the region where 7^ << 'jeise, the 71 pseudothreshold is a very 
substantial overestimate of the true threshold. On the plot we indicate the line for which memory 
failure is one-tenth of gate failure, a situation studied extensively by Steane yD. The 71 pseu- 
dothreshold is around j e i se = 1.2 x 1CT 3 (near the threshold value estimated by Steane), while the 
true threshold is at 7 e / se = 0.34 x 1CT 3 , about a factor of four lower. Looking at a wider range of 
initial failure rate values, we find that the initial point 7(0) is below its true threshold whenever all 
of the 7's decrease on the first iteration of the map. However, this rule of thumb is much too con- 
servative — there are large regions of this plot for which one or more of the 7's initially increase, 
and yet we are below threshold. 

It appears that distinguishing logical one-qubit gate errors from logical two-qubit gate errors 
has an important quantitative effect on our threshold estimates; the 72 curve turns upward much 
more rapidly than the 71 curve if we are near but above the threshold, and, in the vicinity of the 
fixed point in Figs. |4] and 13 72 is twice as large as 71. We see that this factor of two arises from 
a very simple cause: the rectangle describing the replacement rule for the two-qubit gate, Fig.|3l 
has two error correction blocks that can fail. Is this factor of two simply an artifact of how we 
group the encoded computation into rectangles? It is clear that the answer is no; for the two-qubit 
gate, the key fact is that the failure of either error-corrected block will cause the entire encoded 
two-qubit gate, and the two encoded qubits emerging from it, to be faulty. It appears that this is 
the key reason that the differing behavior of one- and two-qubit gates under concatenation should 
be taken into account. 

For memory errors, the story is rather different: we see that for large parts of Fig. |7] which 
are below threshold, 7^ increases (substantially, in fact) under concatenation. This clearly arises 
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from the fact that upon encoding, a waiting period is replaced with an error correction step, with 
all its (noisy) one- and two-qubit gates. One might think, then, that it might be desirable to skip 
error correction upon concatenation of a memory location. While this may indeed be possible, 
it raises a danger that would require more extensive analysis to assess: since single errors would 
go uncorrected, the error rate of qubits fed into the following rectangle would be greater. A 
much more careful calculation of the effects of these passed-on errors would need to be done to 
determine if skipping error corrections would in fact be helpful. 

Finally, we wish to note that the quantities a and (3 are actually quite close to one near the 
threshold values of the failure rates. For (3, we can understand this in the following way: the 
probability of getting a nonzero syndrome, 1 — (3, is roughly the probability for a single fault 
among iV syn locations which make the syndrome nonzero, i.e. we can approximate it as iV syn 7. 
For this argument we forget about any distinctions between types of errors and types of rectangles, 
so the iV syn is some mean number of locations, and 7 is some average failure rate. Now, a rough 
estimate of the threshold 7 (see Sec. IIII Bl) is 1/ ( 2 ) where N is the number of locations that can 
cause errors on the data, see Table ITT1 When N ~ N syn which is the case, we have f3 ~ 1 — 2/N. 
Since iV is somewhere between 100 and 200, we conclude that (3 should be well above 90%, and 
this is what we see. A similar discussion can be given for a. In some cases, at the pseudothreshold, 
the values of a and (3 are much smaller. 

VI. THE LOCAL MODEL WITH THE [[7, 1, 3]] CODE 

There are two main modifications that take place if we demand that all gates be local. First, 
each error correction procedure needs to be modified so that it only consists of local gates. In this 
paper, we do not consider the additional overhead that is incurred from making the error correction 
local. Second, we have to use the local replacement rules as given in Figs. l3land fTTI - fT5l 

The typical values for the scale factor r, which we will vary in our numerical analysis, can 
be estimated by considering how many qubits are in the error correction routine. For a nonlocal 
routine this number of qubits (which includes one block of data qubits) is k = 7 + 2 x n rep (7 + 3). 
In the regime (which we have found to be the relevant regime in the nonlocal numerical study) 
where a — > 1, n rep — > 4, this gives k = 87. Note that we count both the ancillas in X and Z 
since the X ancillas will be prepared during the Z routine. By making the error correction local 
(for example by using dummy qubits) this number will increase somewhat. Thus it seems that 
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taking r in the range of 10-100 may be reasonable (for a two-dimensional architecture we may 
take r m \yk~\ which would give r = 10). The operations that move qubits around over distance 
r are composed from operations that move over distance d, where r = rd and r is some integer. 
We assume that the failure probability scales linearly with distance (which is a good assumption 
for small errors), i.e. if a move(d) operation has failure probability ^ m d then a move(r) operation 
has failure probability 7 mr = T^ md . 

As it turns out, in Steane's error-correcting procedure, there are almost no one-qubit gates that 
occur in parallel with a two-qubit gate. The only exception is the preparation of the verification bits 
in the state |+) that occurs during Q, but these can be prepared at the last convenient moment. This 
implies that the computation is always a sequence of move gates followed by local 'in situ' gates. 
The modelling in Section llil Al shows that there are two types of wait locations, ones that originally 
occur while a two-qubit gate occurs and ones that occur during a one-qubit local gate. The wait 
locations of the first type get mapped onto much longer wait and error correction procedures, since 
they have to wait until the data has been moved. We also assume that data has to be moved back 
in place for the next gate, but it may be more efficient to move it elsewhere so that it is ready for a 
possible next nonlocal gate. 

In the upcoming analysis, we distinguish between the failure probabilities for composite and 
elementary rectangles denoted as 7 c (n) and 7 e (n). For n = 0, we of course have 7 C (0) = 7 e (0). 
We enumerate the types of locations and their probabilities in Table II VI 



Location 


Description 


Failure Prob. 


1 


one-qubit gate 


7i 


2 


two-qubit gate 


72 


wl 


wait during one-qubit gate 


Jwi 


w2 


wait during two-qubit gate 




md 


move distance d 


7md 


wd 


wait during move(d) 


Iwd 


lm 


one-qubit gate + measurement 


7lm 


P 


preparation 


7p 



TABLE IV: Types of locations and their failure probability symbols in the local analysis. 
We now discuss the required modifications of the nonlocal model as compared to the local 



30 



analysis with the [[7, 1, 3]] code. 

A. Modifications In The Failure Probability Estimation 

Each location / gets replaced by a composite 1 -rectangle denoted as Rf containing more than 1 
elementary 1-rectangle, denoted as Rj. In order for the composite rectangle to fail at least one of 
the elementary rectangles has to fail, or 

7 f(n) = l-n,-| i6 i?(l-^(n)), (2V) 

where the failure probabilities 7j(n) are calculated similarly as in the nonlocal model (see Eqs. 
© - (O). Table M lists the occurrences of elementary 1-rectangles in composite 1-rectangles. 
The elementary failure probabilities 7|(n) are again functions of the vector of composite failure 
probabilities 7 c (n — 1), i.e. 7?(n) = ¥' j(j c {n — 1)). 



1 


j | R) E Rf 


1 


1[1] 


2 


movc(d)[2r = 2r/d], wait(d)[2r], 2[1] 


lm 


lm[l] 


P 


P[l] 


move(d) 


move(d)[r] 


wait(d) 


wait(d)[r] 


wl 


wl[l] 


w2 


wait(d)[2r], w2[l] 



TABLE V: Each location I becomes a set of 1-rectangles by concatenation. The table lists which types 
of elementary 1-rectangles are present in the composite 1-rectangle Rf based on the replacement rules of 
Figs. l^1and [TTHT?1 The number between [] indicates how often the elementary 1-rectangle occurs inside the 
composite 1-rectangle. 

Now we list the necessary modifications to the failure probability of an elementary rectangle 
and the estimation of a, the probability of an ancilla passing verification, and (3, the probability 
of obtaining a zero syndrome. Note that the failure probability is now a function of the composite 
failure probabilities at the lower level. First we list the modifications to Table HT1 in Table IVT1 In 



A Modifications In The Failure Probability Estimation 
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the source 'errors due to propagation from the ancilla', we also need to use a modified a, (3, and 
P(pass and no X), estimated in the next section. 



Modified Source 


5 


N 


Memory faults on data at the end of S 


Iwl 


lAiSr + Sr) 


Memory faults on data during 1Z 


Iwl 




Memory faults (wl) on data when s = 1 


Iwl 


U(s-l)(5 Sz>1 +5 Sxtl ) 


Memory faults (w2) on data when s = 1 


Iwl 


7(s-l)(S SzA + 5 SxA ) 


X errors on ancillas waiting (wl) for S 


Iwl 


Us(s-l)(S Sz , s + S Sx , s )/2 


X errors on ancillas waiting (w2) for S 


lw2 


7s{s-l)(S SztS + S Sx , s )/2 



TABLE VI: Modified memory sources of failure and their contribution to the failure probability. We only 
list the sources that are different due to the distinction between wl and w2, the other sources are unchanged. 

For a rectangle that acts on a single block, i.e. I = p,wl,w2, 1, lm, move(d), wait(d), we 
write, similar to Eq. © 

7 r(n)=/5V / [l,l](T(n-l)) + 2/5(l-/3)F^[ S ,l](T(n-l)) + (l-/5) 2 FV S , S ](r(n-l)), (28) 
where the function F'i takes into account the modifications in the failure sources. 

B. Modifications in a and [5 

In each of the expressions in Section ITV CI (see Eqns. <TT4b - (1261) ) , we have to use the failure 
probabilities of the composite rectangles. Equations (IT4T> and (fT3l change due to the distinction 
between wl and w2 locations: 

P(pass and no Z) = (1 - 7 p 4 (l - 7i C ) 4 (l - 2 7l c m /3) 4 x 
IW1 - ^) NiieG) (i ~ 27^/3) 12 (l - 2 7 : 1 /3) 14 (1 - 7 ^) 6 (1 - 72 c ) 13 , (29) 

and, slightly different, 

P(pass and no X) = (1 - 2 7 ^/3) 4 (l - 2 7l c /3) 4 (l - 2 7l c m /3) 4 x 
n i6 g(l - 2^/3)^(1 - 2 7 ^ 2 /3) 18 (l - 2 7 ^/3) 14 (l - 72 c ) 13 , (30) 



B Modifications in a and (3 
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Note that in Eq. (I2TT) we maximize over all possible locations in this new model. We also 
distinguish between wl and w2 in Eq. (l26l) : 

P(no X err. on waiting data) = (1 - 2 7 ^ 1 /3) 14(s " 1) (l - 2 7 ^ 2 /3) 7(s " 1) . (31) 

VII. NUMERICAL THRESHOLD STUDIES FOR THE LOCAL MODEL 

By numerical iteration of the equations of the preceding sections, we study the repeated appli- 
cation of the map determined by encoding with the [[7, 1, 3]] code in the local model. Although we 
now have an even higher-dimensional map than in the nonlocal studies (eight dimensions rather 
than five), the two cases are mathematically very similar; it is evident that the structure of the flows 
is again determined by the presence of an unstable fixed point with one positive eigenvalue (and in 
this case 8 — 1 = 7 negative eigenvalues). An important difference is that the local map contains a 
free parameter, r, the frequency of error correction while moving; we will exploit the freedom to 
optimize the fault-tolerance threshold in the numerical studies below. 

Our first observation, illustrated in Fig. [U is that the numerical values of the threshold failure 
probabilities can in fact be strongly affected by the need to transport qubits. For this figure we 
take physical failure rates 71 = 72 = 7m = 1 P , Iwi = 7w2 = 0.1 x 72, j wd = 0.1 x ^ md , and 
Imd = t/t x 72 = d x 72. In words, this means that the gate, measurement, and preparation failure 
rates are taken all equal, wait errors (per unit time or per unit distance travelled during moving 
periods) are one-tenth of the gate failure rate, and moving a qubit over a unit distance is as noisy 
as a gate operation (corresponding to a scenario, say, in which moving over unit distance requires 
an actual swap gate execution). We have also optimized r to be r = 4, that is, d = [(r/r)] = 13, 
which means that error correction is performed on qubits in transit once every 13 units of distance 
moved (13 swap gates, say). 

As Fig. H] shows, for these conditions the threshold (we plot the 72 threshold value) decreases 
strongly with r; the dependence is very close to ^ hresh oc 1/r, confirming the analysis in Section 
HUB I Note however findings are more optimistic here than the analytical lower bound in Section 
IIIIB1 that is, we see that -y* hresh ' loc ~ ^thresh.nonioc x c j r some cons t an t c which is a bit larger 
than 1 . For a scale parameter r = 20, which could well be a reasonable number, we get ^ hresh - = 
0.73 x 10~ 4 , nearly an order of magnitude below the numbers typical in the nonlocal model, shown 
in Fig. 13 We have plotted these results in the high noise limit, but we have found similar behavior 
when the noise during transit is not very high, as seen in the dependence on r in Fig. [10] for small 
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FIG. 8: Gate failure rate threshold versus the scale parameter r for the local model. We have taken 71 = 
72 = 7m = 7 P » Iwi = Jw2 = 0.1 x 72, j wd = 0.1 x -y md , and j md = r/r x 72. r, the frequency with 
which a qubit is error-corrected while being moved over distance r is optimized in every case. The threshold 
follows very close to a \jr dependence. 



e. 

Fig. |9] shows the result of varying r for the failure probability choices of Fig. [U with fixed 
r = 50. We do this by choosing a r that minimizes the threshold probability. After that we fix r to 
be the optimal value, that is we do not adjust r at each level of concatenation. While the threshold 
value is not a very strong function of r, it is clearly optimal for r = 4. In more general studies in 
which we vary the initial values for 7 and r we do not find a simple relation between the optimal 
r and these parameters. 

This result was initially surprising to us, since it says that it is optimal to allow the moving 
qubits to become about thirteen times noisier than the qubits involved in gate operations before 
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FIG. 9: Gate failure rate threshold versus r, the frequency of error correction of a transported qubit, for 
r = 50. As in Fig. El we have taken 71 = 72 = 7™ = y p , j w i = 7^2 = 0.1 x 72, j wd = 0.1 x -f md , and 
Imd = r/r x 72. While not very strongly r-dependent, the optimal threshold occurs at r = 4. 



they are error corrected. The explanation for this seems to be that since qubits in motion do not 
have a chance of spreading error to other data qubits, allowing them to get noisier is not dangerous, 
and is actually desirable given the level of errors introduced by the error correction step itself. Of 
course, before they couple to other qubits we perform an error-correcting step in order to get rid of 
the accumulation of errors. A similar choice of less frequent error correction may be advantageous 
for a qubit who undergoes a few one-qubit and wait locations in succession. In such a case, errors 
do not spread to other blocks during these procedures and we finish the sequence by an error- 
correcting step as in the qubits in transit case. 

Finally, Fig. [TO] shows the result of varying between noiseless moving and high-noise moving 
scenarios. This is captured by varying the parameter e in the setting j md = er/r x 72. The 
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FIG. 10: Gate failure rate threshold versus e, a parameter that measures the relative noise rate per unit 
distance for a qubit being moved. We initially set 71 = 72 = 7m = 7p> 7u>i = 7u>2 = 0.1 x 72, 
Iwd = 0.1 x r/r x 72, and 7^ = er/r x 72. Scale parameters r equals 20, 50, and 80 are studied. At 
every point r is re-optimized. The dependence on e is slow, evidently slower than 1/e. 



choice for ^ md reflects the idea that the failure rates for qubits that are waiting during a move step 
should depend only on the distance moved (and therefore, the time waiting during each elementary 
move step), e = 1 is exactly the scenario explored in Figs. [S]and|5| e = corresponds to free 
moving, in which the qubit can be converted into some noiseless flying form for transportation; 
a rather artificial feature of this limit is that waiting is then noisier than moving. In Fig. [lOl 
the other parameters are initially set as: 71 = 72 = 7m = 7 P > Iwi = 7w2 = 0.1 x 72, and 
j wd = 0.1 x r/r x 72. 

It is evident from this that the error threshold is a weaker function of the moving failure rate 
e than it is of the scale parameter r. When e — > the waiting during moving is more error- 
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prone than the moving itself and this waiting should be the main cause of the 1/r behavior in this 
limit. In other words it is the scale-up of the circuit with every level of concatenation and the 
additional waiting this causes, i.e. a wait(d) location gets replaced by r wait(d) locations, that is 
the dominant reason why the threshold is lower than in the nonlocal model. 

On the other hand, the "weak" dependence on e seems to indicate that repeated error correction 
during moving is able to maintain acceptable fidelity for the moved qubits even in the face of 
moving errors. 

This gives some new hope for schemes, such as those involving spins in semiconductors or 
Josephson junctions, in which qubit moving is inherently as difficult as gate operations. We know 
that in such a high failure-rate regime, entanglement distribution followed by purification and then 
teleportation, can be a more effective way of moving qubits BatZlLLZD- The rather strong sensitivity 
to r that we find (Fig. [8]) suggests that if such strategies are employed, they should best be used in 
a way which does not increase the number of ancillas needed, and hence the scale parameter, too 
much. 

Our numerics of course add a note of caution to this optimism: although the e dependence we 
find is not too severe, over most of the range of the plot in Fig. [TOl the actual values of the fault- 
tolerance threshold failure rate is well below 10~ 4 , in a range that is presently far, far beyond the 
capability of any quantum computer prototype in the laboratory. 



VIII. OUTLOOK 



We see at least two extensions of this direction of research. One is to indeed make the error 
correction routine local, assuming some mechanism for short-distance transportation and a spatial 
layout of the qubits. We could then redo our local analysis, possibly with some more lengthy 
analysis of the failure probability that includes more details, in order to get a full estimate of the 
change in threshold due to locality. Secondly, one needs to consider where all the additional error 
correction in transit and moving will take place and has to design a layout for this. Given this 
layout there may be modifications to the replacement rules in order to reflect the real architecture. 
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APPENDIX A: REPLACEMENT RULES 
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FIG. 11: The replacement rule for a one-qubit gate location U or a waitl location. The dashed box rep- 
resents a 1 -rectangle. £ represents the error correction procedure. U represents the local fault-tolerant 
implementation of U. Note that in each figure, a qubit in M ra _i is encoded asm = 7 qubits in M n . 
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FIG. 12: The replacement rule for a one-qubit gate U followed by a measurement. 
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FIG. 13: The replacement rule for a wait2 (also called w2) location acting in parallel with a two-qubit gate. 
The replacement circuit contains three elementary 1 -rectangles. 
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FIG. 14: The replacement rule for a move(r) gate. The replacement circuit contains r elementary 1- 
rectangles. 
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FIG. 15: The replacement rule for a wait(r) gate. The replacement circuit contains r elementary 1- 
rectangles. 
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APPENDIX B: DEFINITIONS OF n-RECTANGLES, BLOCKS AND SPARSENESS 

• A set of qubits in M n is called an s-block if they originate from one qubit in M n _ s . A s- 
rectangle in M n is a set of locations that originates from one location in M„_ s . A s-working 
period is the time interval in M n which corresponds to one time step in M n _ s . 

• Let B be a set of n-blocks in the computation M n . An (n, k)-sparse set of qubits A in B is 
a set of qubits in which for every n-block in B, there are at most k (n — l)-blocks such that 
the set A in this block is not (n — 1, A;)-sparse. A (0, /c)-sparse set of qubits is an empty set 
of qubits. 

• A set of locations in a n-rectangle is (n, k)-sparse when there are at most k (n— 1) -rectangles 
such that the set is not (n — 1, A;)-sparse in that (n — l)-rectangle. A (0, A;)-sparse set of 
locations in a 0-rectangle is an empty set. A fault-path in M n is (n, A;) -sparse if in each 
n-rectangle, the set of faulty locations is (n, /c)-sparse. 

• A computation code C has spread t if one fault occurring in a particular 1 -working period 
affects at most t qubits in each 1 -block, i.e. causes at most t errors in each 1 -block in that 
particular working period. 
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APPENDIX C: ERROR-CORRECTING USING THE [[7, 1, 3]] CODE 
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FIG. 16: The Steane -error correction protocol, X [5]. The black circle represents control on a nonzero 
result. A white circle represents control on a zero result, s' represents a classical procedure to check if s' of 
the s syndromes agree. The dashed box procedure is applied only if the controlling syndrome is not zero. 
There are n rep prepared ancilla blocks. Each line represents 7 qubits. After V, a 'good' verification blocks 
remain. TZ represents the recovery procedure. 
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FIG. 17: The Q network for X or Z '-error correction [3]. The network can be executed in 5 time steps. It 
produces the encoded |0) state. The boxed zero represents preparation of a |0) state. 
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FIG. 18: The V network 
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14], executable in 6 time steps. The boxed zero represents preparation of the 
|0) state. The state |0) is the seven-qubit encoded |0) state. If each measurement output is 0, then the ancilla 
block is deemed 'good', that is, it has been checked for X errors. The network is the same for the Z-&vcor 
correction procedure. 



APPENDIX D: GATE COUNTS 



We calculate the number of locations in the circuits Q, V, S, and 1Z for the recovery gate (see 
Figs. [T7J I18I19I) . Note that when recovery takes place, a one-qubit gate is executed on the data. 
We denote these numbers as N(i e Q), etc. 





1 


2 


wl 


w2 


lm 


P 


s 





7 


14 (on data) 





7 





Q 


3 


9 


4 


3 





7 


V 


4 


13 


14 


15 + 3 


4 


4 


K 


1 





6 












TABLE VII: Number of locations of each type (1, 2, wl, w2, lm, or p) in individual routines Q, V, S and 
the recovery gates 1Z. The wl and w2 locations combined are simply called w locations in the nonlocal 
analysis. 
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FIG. 19: The syndrome network 5 for X-error correction [3]. This network can be executed in 3 time steps. 
Here EE represents classical error extraction. The network S for Z-error correction uses C x gates in place 
of C z gates, with the ancillas acting as control and the data as target qubits. 
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